Reconfigurable arithmetic for HPC

نویسندگان

  • Florent de Dinechin
  • Bogdan Pasca
چکیده

An often overlooked way to increase the efficiency of HPC on FPGA is to tailor, as tightly as possible, the arithmetic to the application. An ideally efficient implementation would, for each of its operations, toggle and transmit just the number of bits required by the application at this point. Conventional microprocessors, with their word-level granularity and fixed memory hierarchy, keep us away from this ideal. FPGAs, with their bit-level granularity, have the potential to get much closer. Therefore, reconfigurable computing should systematically investigate, in an application-specific way, non-standard precisions, but also non-standard number systems and non-standard arithmetic operations. The purpose of this chapter is to review these opportunities. After a brief overview of computer arithmetic and the relevant features of current FPGAs in Section 2, we first discuss in Section 3 the issues of precision analysis (what is the precision required for each computation point?) and arithmetic efficiency (do I need to compute this bit?) in the FPGA context. We then review several approaches to application-specific operator design: operator specialization in Section 4, operator fusion in Section 5, and exotic, non-standard operators in Section 6. Section 7 discusses the application-specific performance tuning of all these operators. Finally, Section 8 concludes by listing the open issues and challenges in reconfigurable arithmetic. The systematic study of FPGA-specific arithmetic is also the object of the FloPoCo project (http://flopoco.gforge.inria.fr/). FloPoCo offers open-source implementations of most of the FPGA-specific operators presented in this chapter, and

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reconfigurable Cloud

Cloud computing is emerging as an additional deployment option for some of the HPC applications. But still, it currently does not come without drawbacks for application performance. These performance issues take place due to resource contention, where one VM is able to impact the performance of another. This interference directly creates impact on performance and scalability of HPC application....

متن کامل

Performance Monitoring for Run-time Management of Reconfigurable Devices

High-performance computing (HPC) systems with hardware-reconfigurable devices have the potential to achieve major performance increases over parallel computing systems based solely on traditional processors. However, providing services upon which users of traditional HPC systems have come to depend is essential for largescale reconfigurable computing (RC) systems to become mainstream. Along wit...

متن کامل

Novo-G: A View at the HPC Crossroads for Scientific Computing

High-performance computing for many science domains is at a major crossroads. Conventional HPC is illequipped to address escalating performance demands without resorting to massively large, energy-hungry, and expensive machines. This paper focuses upon the principal challenges for HPC for scientific computing, why and how reconfigurable supercomputing is poised to make a major impact on acceler...

متن کامل

ARABICA: A Reconfigurable Arithmetic Block for ISA Customization

We propose a dynamically reconfigurable arithmetic block architecture for customizing embedded application processor instruction sets. Our architecture uses medium-grained arithmetic blocks and a dedicated but reconfigurable interconnection network to support a wide range of instruction-set extensions. Our experimental results demonstrate the performance of our arithmetic block compared to a ge...

متن کامل

Reconfigurable and adaptive photonic networks for high-performance computing systems.

As feature sizes decrease to the submicrometer regime and clock rates increase to the multigigahertz range, the limited bandwidth at higher bit rates and longer communication distances in electrical interconnects will create a major bandwidth imbalance in future high-performance computing (HPC) systems. We explore the application of an optoelectronic interconnect for the design of flexible, hig...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012